Minimizing Word Error Rate in Textual Summaries of Spoken Language

نویسندگان

  • Klaus Zechner
  • Alexander H. Waibel
چکیده

Automatic generation of text summaries for spoken language faces the problem of containing incorrect words and passages due to speech recognition errors. This paper describes comparative experiments where passages with higher speech recognizer confidence scores are favored in the ranking process. Results show that a relative word error rate reduction of over 10% can be achieved while at the same time the accuracy of the summary improves markedly.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Is Word Error Rate a Good Indicator for Spoken Language Understanding Accuracy

It is a conventional wisdom in the speech community that better speech recognition accuracy is a good indicator for better spoken language understanding accuracy, given a fixed understanding component. The findings in this work reveal that this is not always the case. More important than word error rate reduction, the language model for recognition should be trained to match the optimization ob...

متن کامل

Stochastic Language Adaptation over Time andState in Natural Spoken Dialogue

| We are interested in adaptive spoken dialogue systems for automated services. Peoples' spoken language usage varies over time for a given task, and furthermore varies depending on the state of the dialogue. Thus, it is crucial to adapt ASR language models to these varying conditions. We characterize and quantify these variations based on a database of 30K user-transactions with AT&T's experim...

متن کامل

A Real-Time Spoken-Language System for Interactive Problem Solving

SRI has developed a spoken language system to retrieve air travel planning information. Progress can be measured by comparing DARPA benchmark results in February 1992 and November 1992. Between February 1992 and November 1992, for all utterances tested, SRJ's word error rate in the ATIS speech recognition test improved from 11.0% to 9.1%. Weighted utterance error improved from 31.1% to 23.6% in...

متن کامل

Adult’s Learning Strategies for Receptive Skill Self-managing or Teacher-managing

Receptive language skill refers to answering appropriately to another person's spoken language. A lot of teachers try to develop receptive language skills in their language learners. When receptive language skills are not appropriately acquired, learners may miss significant learning opportunities resulting in delays in the development and acquisition of spoken language. The goals of this paper...

متن کامل

The SI TEDx-UM speech database: a new Slovenian Spoken Language Resource

This paper presents a new Slovenian spoken language resource built from TEDx Talks. The speech database contains 242 talks in total duration of 54 hours. The annotation and transcription of acquired spoken material was generated automatically, applying acoustic segmentation and automatic speech recognition. The development and evaluation subset was also manually transcribed using the guidelines...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000